Detection of Protein Catalytic Sites in the Biomedical Literature

نویسندگان

  • Karin M. Verspoor
  • Andrew MacKinlay
  • Judith D. Cohn
  • Michael E. Wall
چکیده

This paper explores the application of text mining to the problem of detecting protein functional sites in the biomedical literature, and specifically considers the task of identifying catalytic sites in that literature. We provide strong evidence for the need for text mining techniques that address residue-level protein function annotation through an analysis of two corpora in terms of their coverage of curated data sources. We also explore the viability of building a text-based classifier for identifying protein functional sites, identifying the low coverage of curated data sources and the potential ambiguity of information about protein functional sites as challenges that must be addressed. Nevertheless we produce a simple classifier that achieves a reasonable ∼69% F-score on our full text silver corpus on the first attempt to address this classification task. The work has application in computational prediction of the functional significance of protein sites as well as in curation workflows for databases that capture this information.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Expression of a Chimeric Protein Containing the Catalytic Domain of Shiga-Like Toxin and Human Granulocyte Macrophage Colony-Stimulating Factor (hGM-CSF) in Escherichia coli and Its Recognition by Reciprocal Antibodies

Fusion of two genes at DNA level produces a single protein, known as a chimeric protein. Immunotoxins are chimeric proteins composed of specific cell targeting and cell killing moieties. Bacterial or plant toxins are commonly used as the killing moieties of the chimeric immunotoxins. In this investigation, the catalytic domain of Shiga-like toxin (A1) was fused to human granulocyte macrophage ...

متن کامل

Study of PKA binding sites in cAMP-signaling pathway using structural protein-protein interaction networks

Backgroud: Protein-protein interaction, plays a key role in signal transduction in signaling pathways. Different approaches are used for prediction of these interactions including experimental and computational approaches. In conventional node-edge protein-protein interaction networks, we can only see which proteins interact but ‘structural networks’ show us how these proteins inter...

متن کامل

Novel Bi-allelic PDE6C Variant Leads to Congenital Achromatopsia

Background: The clinical phenotyping of patients with achromatopsia harboring variants in phosphordiesterase 6C (PDE6C) has poorly been described in the literature. PDE6C encodes the catalytic subunit of the cone phosphodiesterase, which hydrolyzes the cyclic guanosine monophosphate that proceeds with the hyperpolarization of photoreceptor cell membranes, as the final step of the phototransduct...

متن کامل

Effects of Confinement in Carbon Nanotubes on the Performance and Lifetime of Fischer-Tropsch Iron Nano Catalysts

The effects of confinement in carbon nanotubes on Fischer-Tropsch (FT) activity, selectivity and lifetime of Carbon NanoTubes (CNTs) supported iron catalysts are reported. A method was developed to control the position of the catalytic sites on either inner or outer surface of carbon nanotubes. TEM analyses revealed that more than 80% of iron oxide particles can be controlled to be position...

متن کامل

Extraction of Drug-Drug Interaction from Literature through Detecting Linguistic-based Negation and Clause Dependency

Extracting biomedical relations such as drug-drug interaction (DDI) from text is an important task in biomedical NLP. Due to the large number of complex sentences in biomedical literature, researchers have employed some sentence simplification techniques to improve the performance of the relation extraction methods. However, due to difficulty of the task, there is no noteworthy improvement in t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Pacific Symposium on Biocomputing. Pacific Symposium on Biocomputing

دوره   شماره 

صفحات  -

تاریخ انتشار 2013